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16 Abstract ... 

There !* considerable interest in using plant growth simulation models for <arge area yield 
forecasting in the United States and foreign areas. These models Generally W^^ily 
input values of solar radiation, temperature, and rainfall. For calibration and evaluation 
of these models, historical data are needed. However, historical solar data are rarely 
available because solar radiation has been measured at only a few lo "tions, frequently for 
only limited time periods. It would be necessary to develop surrogates for the historical 
data values. 

Five algorithms producing daily solar radiation surrogates usina daily ^^^"^ « nd Thp 
rainfall were evaluated usinq measured solar radiation data for sevw U.S. locations. The 
algorithms were compared both in terms of accuracy of daily solar radiation ""■JJV^J 11 
terms of response when used in a plant growth simulation model (CERtS-wheat) . Requirements 
for accuracy of solar radiation for plant growth simulation models were discussed »» 
algorithm was recommended as being best suited for use in these models when neither measured 
nor satellite estimated solar radiation values were available. 
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INTRODUCTION 
Large area yield forecasting for the United States and foreign 
areas is an application of plant growth simulation models with great 
potential. Forecasts from these models could be a powerful tool for 
agricultural and economic policy makers in both government and industry. 

Daily solar radiation data are required in most models. Until 
recently, solar radiation had been measured at only a few locations 
around the world. Because of the lack of available c'ata, various surro- 
gates for measured solar radiation are being developed and tested for 
use in the models. Recently, satellite estimated solar radiation became 
available for most of the Western Hemisphere on a "real-time" basis. If 
satellite data can be used to produce accurate solar radiation estimates 
one obstacle to using simulation models for large areas or many loca- 
tions would be removed. 

Calibration of a particular crop growth simulation model would be 
needed for area-specific factors: soil fertility, water holding capa- 
city, varietal characteristics, fertilizer and pesticide applications, 
planting practices, and other management practices. This calibration 
also requires historical yield and meteorological data including solar 
radiation data for that area. Because historical solar radiation data 
are not generally available, a surrogate must be used. In this paper 
five algorithms are compared for producing solar radiation surrogates 
from commonly measured daily meteorological variables. 

As used in this paper, a solar radiation "surrogate" will produce 
solar radiation data which is similar to observed data in terms of 
various statistical measures: similar daily mean, similar variability, 
etc. These surrogates are not intended to accurately predict observed 
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ata on any given day. The term "estimate" is reserved for accurate 
ai 1y predictions, and estimates of yearly yield. 

SOLAR RADIATION ALGORITHMS 
Five algorithms producing surrogate measures of solar radiation 
re compared in this study. They are referred to as CE , SR, R0, Rl, and 
R2. The CE algorithm (Cengiz et al, 1981) was developed using data from 
Columbia, Missouri. It is composed of two types of functions. Location 
specific functions require information on latitude. Daily functions 
require the day of the year and daily maximum and minimum temperature 

( T able la). 

The SR, R0, Rl, and R2 algorithms are based on the Richardson 
(1981) weather simulation model (Table lb, c, d, e). The Richardson 
Imodel uses a set of location specific constants to estimate daily rain- 
fall, solar radiation, and maximum and minimum temperature. These 
constants would be available only for locations in the continental 
United States. The solar radiation and temperature values dre estimated 
as daily deviations from annual curves. The annual curves consist of 
long term average daily values. Separate curves are used for dry days 
and for rainy days. Rainy Tays are defined as those days for which 
rainfall has been estimated as being greater than zero. The algorithms 
used in this study modified Richardson's model so that observed tempera- 
tures and rainfall were used to estimate solar radiation. 

The SR algorithm (Table lb) was based only on Richardson's annual 
curves for normal radiation. Separate curves were used for dry days and 
for rainy days. There were no daily deviations from the annual curves. 

To estimate daily deviations from the annual curve values for solar 
radiation, temperature, and precipitation, the Richardson model uses 
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correlations, one day lag correlations and a random component. The 
correlations and one day lag correlations were reported to be approxi- 
mately uniform for the continental U.S. (Richardson, 1981). It may be 
acceptable to extend these correlations to other regions. In the RO 
algorithm, the actual deviations of maximum and minimum temperatures and 
the correlations were used to estimate the daily deviation of solar 
radiation (Table lc). 

The Rl and R2 algorithms also use Richardson's correlations among 
daily deviations of temperature and solar radiation. These correlations 
and the actual daily deviations of maximum and minimum temperatures are 
used to produce daily deviations of solar radiation in the Rl algorithms 
(Table Id). 

Because the daily variability of solar radiation estimated by Rl 
was too small, the daily deviations of SR were amplified for the R2 
algorithm (Table le). The amplification was moderate for deviations 
above the annual curves but greater for deviations below the annual 
curves. Measured data (DOE, 1979) from St. Cloud, MN; Rapid City, SO; 
and Glasgow, MT were used to determine the degree of this amplification. 
Richardson's annual mean values of solar radiation were changed by the 
amplification. The new "annual mean values" were approximately 5% 
greater than the actual values for dry days and 15% greater for rainy 

days. 

OATA 
The SOLDAY (DOE, 1979) data set encompasses the period 1952-1974. 
It consists of measured and rehabilitated (adjusted for known procedural 
and instrumental errors) daily solar radiation values ana associated 
maximum and minimum daily temperatures and rainfall for 27 U.S. 
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Stations. The rehabi li tated solar radiation data were viewed as 
'ground truth" for the purposes of this study. Seven stations (Table 2) 
3[vere selected to compare the five algorithms producing solar radiation 

4 iurrogates for the rehabilitated solar radiation values. Three of the 

5 ieven stations (St. Cloud, Rapid City, and Glasgow) had been used in the 

6 ievelopment of the R2 algorithm. Seven surrogate data sets, one for 
ach station, were developed for comparison with ground truth (Table 2). 

When the rehabilitated and simulated solar radiation values were 
compared over the seven stations, several things became apparent. Some 
of the rehabilitated solar radiation values were obviously too large 
(greater than 85% of solar radiation at the top of the atmosphere). The 
R2 and th2 CE algorithms also occasionally estimated excessively high 
values, the CE algorithm more so than R2. 

The SOLOAY (DOE, 1979) data also included daily values of solar 
radiation at the top of the atmosphere and the percent possible 
transmissivity of the atmosphere for each location. 

For use with the yield model, the rehabilitated solar radiation 
values, the R2 algorithm values, and the CE algorithm values were 
screened for values greater than maximum potential. This maximum was 
defined to be the product of percent possible atmospheric transmissivity 
(%T) and solar radiation at the top of the atmosphere (ETSR). Values 
higher than this maximum were reduced to the maximum. This screening 
algorithm reduced the observed solar radiation values by an average of 
0.7%, the R2 algorithm values by an average of 0.5%, and the CE algorithm 1 
values by an average of 1.0%. Values from the R0, Rl, and SR algorithms 
were not affected by the screening. 
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1 COMPARISON METHODOLOGY 

2 The surrogates of solar radiation could themselves be compared to 

3 the rehabilitated ground truth data to determine which was the best 

4 estimate. These surrogates were developed solely for use in simulation 

5 models, however. Because of this, it was felt that their impact on 

6 yield prediction in the models would be the important criterion. The 

7 best algorithm would not necessarily be the one which produces the best 
g estimates of ground truth but rather would be the one which produces 

g similar yield predictions when used in a simulation model. 
10 The Ceres-wheat model (Ritchie and Godwin, 1983; Otter, Ritchie and 

H Godwin, 1983) was the model selected for comparison of the five solar 

12 surrogates. This program requires initial parameter values for initial 

13 soil water content, soil water retention characteristics, variety of 

14 wheat ( Triticum aestivum L.) planting density and depth, planting date, 

15 ana latitude. 

16 Model estimates were derived using data from three of the SOLDAY 

17 stations: St. Cloud, Minnesota; Rapid City, South Dakota; and Glasgow, 

18 Montana. Both continuous cropping and summer fallowing practices were 

19 used. Median planting dates for each year at each station were estima- 

20 ted using a spring small grains planting date model (Hodges and Artley, 
2i 1981). Daily values of rainfall and maximum and minimum temperature 

22 were required for each station. 

23 The solar radiation input was first supplied by the rehabilitated 

24 ground truth data. Five additional model estimates were generated with 

25 identical inputs for all variables except solar radiation. For each 

26 of these, data estimated using one of the solar radiation algorithms 
07 were used. The model was also run using unscreened values from the 
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ground truth solar radiation and from the R2 and CE algorithms. On the 
average, yields were reduced by less than 1% compared to model estimates 
using screened data. 

The resulting predicted yields using each of the solar surrogates 
could then be compared to yields predicted using the ground truth data. 
The algorithm which led to results most similar to that using ground 
truth data could then be determined. The sensitivity of the model to 
variations between the algorithms could also be studied. 

For the algorithm comparison, the yearly difference (D) between 
ground truth yield predictions (6TY) and predictions using each surro- 
gate (EST) would be calculated: 

D = GTY - EST. 

The arithmetic mean of D would indicate the bias of the yield 
estimates. Smaller bias measures would imply better surrogates. Bias 
values would be calculated for each algorithm for each station for 
continuous cropping and for summer fallow, a total of thirty values. 

It would also be important for the root mean square error, RMSE, to 
be small to indicate that more estimates have a small D value than a 
large one. This statistic is calculated by: 

RMSE = 



=\p/n 



The standard deviation of the D values (SD) is also calculated. 
This indicates what the RMSE would be if the bias were removed. 

Maximum values of D (MAX D) and minimum values of (M1N D) would 
also be compared. As a final measure of the similarity between GTY and 
EST, the Pearson correlation coefficient, CORR, would be determined. 

COMPARISON RESULTS 

Statistics used for comparison of yield predictions using each of | 
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the five solar radiation surrogates are shown in Table 3. Mean and 
standard deviations of yield predictions using the rehabilitated solar 
radiation data (GT) were compared to predictions using each solar 
surrogate- 
In half of the cases, R2 produced yield estimates most similar to 
those using ground truth data. SR was nearest in one third of the cases. 
The CE algorithm had the highest bias genera. ly. The SR algorithm had 
a standard deviation of the estimates nearest to ground truth. R0 was 
the poorest in terms of similar variation. 

The RMSE values indicate that the R2 algorithm produced yield 
estimates which had less variation around ground truth, followed by CE. 
f again was the poorest. If the bias were removed, the SD values show 
that the CE algorithm would have been least variable. R0 was, again, 
the poorest. The bias, being a function of the model's sensitivity to 
solar radiation, would be difficult to remove. 

The range of the data, shown by MAX D and MIN 0, indicates that for 
St, Cloud, MN summer fallow, all of the algorithms except R2 produced 
estimates which were too low in all of the years. For other areas, t'.e 

ranges are comparable. 

The correlation values indicate the closest correspondence between 
CE yield estimates and ground truth. R0 and R2 did poorest using this 
criterion. 

Although "best" and "worst" surrogates could be detected, all were 
very close. Each would be acceptable in terms of the correlation of 
their predicted yields with those yields predicted using the 
rehabilitated solar radiation data. Differences between indicators for 
summer fallowed and continuous cropped were generally negligible. 
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DISCUSSION 

The magnitude of day to day variability of solar radiation would be 
more critical than day to day accuracy. This is due to the strong 
effect of solar radiation on the modeled soil water balance. When ample 
soil moisture is available, evaporation occurs at an "energy-limited" 
rate proportional to the energy available from solar radiation. When 
the modeled water content of the soil surface is depleted more than a 
certain amount (U), direct evaporation of water from the profile 
(excluding transpiration) occurs at a rate roughly proportional to the 
square root of the number of days on which drying has occurred. This 
"time-limited" evaporation rate is generally much lower than the 
energy-limited evaporation rate. On rainy days, 4/5 of the rainfall is 
available for evaporation at the "energy-limited" rate even if the sur- 
face water depletion is greater than U. On the next dry day, moisture 
that has entered the profile is evaporated at the "time-limited" rate if 
the surface water depletion is greater than U. Consider that in a dry 
situation when a small amount of rainfal. occurs, moderate or high 
solar radiation will cause near total evaporation. However, low solar 
radiation will allow most of the rain to enter the soil profile and be 
subject to "time-limited" evaporation. Thus, on two days with small 
amounts of rainfall, aaily solar radiations of 700 and 100 langleys 
respectively would allow considerably more water to enter the profile 
than would two days of 400 langleys each. 

In the Ceres-wheat model, carbon fixation is affected by solar 
radiation in a nonlinear fashion. For radiation amounts to 467 
langleys/day of intercepted light, carbon fixation is proportional to 
light. At higher light intensities, no additional carbon is fixed. 
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Uniformly moderate solar radiation (as opposed to highly variable solar 
radiation) will result in more biomass accumulation, more leaf growth, 
more water use, and under moist conditions, higher yield. However, with 
dry conditions, more water stress and lower yield will result. 

CONCLUSIONS 

Although the differences between algorithms were small, the bias 
and root mean square error indicated that the R2 algorithm would be 
recommended as a solar radiation surrogate for use in simulation models. 
When used in the Ceres-wheat yield model, the R2 solar radiation surro- 
gate produced yield predictions closest to those using ground truth 
solar radiation data. The CE algorithm also produced close estimates, 
but had a larger bias which would be difficult to remove as it is a 
function of the model's sensitivity to solar radiation. 

The R2 algorithm would also be recommended for use in foreign areas. 
The location specific coefficients for the R2 algorithm can be derived 
from long term average monthly solar radiation values; these would be 
available world-wide (de Jong, 1973). Only an assumption about the 
average difference between solar radiation on rainy days and on dry days 
for a location would be needed for use of this surrogate. Because of 
this, the R2 algorithm would be recommended for use in areas for which 
neither measured nor satellite estimated solar radiation values are 
available. 
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Table 1. Equations for Solar Radiation Simulation Algorithms 


2 


a. CE Algorithm (Cengiz et il. 1961) 


3 


Solar Radiation = 49.03 * .1 *F 1 S - 7.26 *DBR 


4 


+ .06 -• FIS * DBS 


5 


Location specific functions: 


6 


S = Sin (Latitude * n/180.) 


7 


T = Tan (Latitude * n/180.) 


6 


C = Cos (Latitude * n/180.) 


9 


SLD = Arccin ((.5 + .007895/C + .2168875 *T)V2) * 180./ n 


10 


SN = Sin ( n* SLD/24.) 


11 


A = (S * (46.355 * SLD - 574.3885) * 816.41 * C * SN) 


12 


* (.29 * C + .52) 


13 


B = (S *(574. 3885-1. 509 * SLD) - 26.59 * (C*SN) * (.29 * C+.52)) 


14 


Daily Functions: 


15 


SI = Sin (2 n/365. * (JULIAN DATE + 10.5) - 1.5708) 


16 


FIS = A + B * SI 


17 


DBR = (TX - TN)* 5/9 


18 


b. SR Algorithm (Richardson, 1981) 


19 


Solar Radiation = RM (I) + AR * cos (.0712 * (Julian Date - 172)) 


20 


RM (1) = Annual mean solar radiation for dry days 


21 


RM (2) = Annual mean solar radiation for rainy days 


22 


AR = Amplitude of annual solar radiation curve 


23 


For dry days, 1=1; for rainy days, I = 2 


24 
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Table 1 (continued) 
RO Algorithm (largely based on Richardson, 1981) 

Solar Radiation = SRL * SRSD + SRBAR 

If Solar Radiation < 0.0 then Solar Radiation = 0.0 
If Solar Radiation > 770. then Solar Radiation = 770. 

Location specific constants: 

TXM (1) and TXM (2) = mean annual maximum daily temperatures for 
dry days (1) and wet days (2). 

ATX = Amplitude of annual curves (dry day and wet day) daily 
maximum temperature 

CVTX = coefficient of variation of daily deviations of maximum 
temperature from annual curves 

ACVTX = coefficients of variation of ATX 

TNM ■ mean annual daily minimum temperature 

ATN = Amplitude of annual curve of daily minimum temperature 

CVTN = coefficient of variation of daily deviations of minimum 
temperature from annual curve 

ACVTN = coefficient of variation of ATN 

RM (1) and (2) = mean annual daily solar radiation for dry days 
(1) and rainy days (2) 

AR = amplitude of annual curves of daily solar radiation 

CVR (1) and (2) = coefficients of variation of daily deviations 
of solar radiation from annual curves for dry days (1) and 
for rainy days ACVR (1) and (2) = coefficients of variation 
of AR for dry days (1) and for wet days (2) 
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Table 1 (continued) 
Daily functions: 

SRSD = ABS (SRBAR * (CVR (1) ♦ ACVR (I) * OR)) 

SRBAR = RM (I) t AR * DR 

A and 3 are matrices (3 x 3) derived by Richardson to describe the 

intercorrelations between daily maximum and minimum temperatures 

and solar radiation in the continental United States. 

I = 1 for dry days or 1 = 2 for rainy days 

ASRL = A (3,1) * PTXL + A (3,2) * PTNL + A (3,3) * PSRL 

where PTXL, PTNL, and PSRL are TXL, TNL, and SRL values from 
the previous day 

OT = Cos (.0172 * (Julian date - 200)) 

DR = Cos (.0172 * (Julian date - 172)) 

TXBAR = TXM (I) + ATX * .OT 

USD = ABS (TSBAR * (CVTX + ACVTX * DT)) 
The above equations are from the Richardson (1981) weather simulation. 
The following five equations were developed to adapt the weather 
simulator to estimate only solar radiation: 

SRL = ASRL ♦ B (3,1) * TXL + B (3,2) * TNL 

TXL = (TX - TXBAR)/TXSD 

If TXL > 1.5 or TXL < -1.5 then TXL = 0.0 

TNL = (TN - TNBAR)/TNSD 

If TNL > 1.5 or TNL < -1-5 then TNL = 0.0 

Rl Algorithm 

Same as RO algorithm except: 

SRSD = .25 * SRBAR 

TXSO = 9. 

TNSD = 9. 
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Table 1 (continued) 


2 

e 


R2 Algorithm 


3 


Same as RO algorithm except: 


4 


Solar Radiation = Noise * SRL * SRSO ♦ SRBAR 


5 


SRSO = .1 * SRBAR 


6 


TXSD = 14. 


7 


TNSO = 14. 


8 


where Noise = 4.4 for SRL > 0.0 on dry day. 


9 


= 11.44 for SRL >_ 0.0 on rainy days 


10 


= 13.2 for SRL < 0.0 on dry days 


11 


= 34.32 for SRl £ 0.0 on rainy days 


12 


If Solar Radiation > 770.1 y/day then Solar Radiation = 770. 


13 


RM (1) and RM (2) should b3 approximately 5% and 15% greater than 


14 


the actual annual mean daily solar radiation for dry days and 


15 


rainy days respectively. 
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Tab]* ?. Statistics for comparison of solar radiation estimates to ground 

measured solar radiation (GT) 
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GT 



Dry Day s Wet Days 

CE R0 Rl R2 SRBAR GT CE RO fcl R5" 



n 421. 376. 413. 412. 423. 412. 258. 294. 241. 242. 257. 
801. 801. 727. 769. 801. 663. 801. 801. 637. 584. 792. 



803. 315. 724. 763. 783. 660. 801. 817. 680. 629. 816, 
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